Nucleic acids encoding dog gastric lipase and their use for the production of polypeptides

ABSTRACT

The present invention relates to dog gastric lipase as obtained by genetic engineering as well as to the nucleotide sequence encoding this recombinant DGL. It also relates to the use of this recombinant DGL for the production of pharmaceutical compositions intended especially for the treatment of pathologies linked to the insufficiency, or even the absence, of secretion of lipases in the body of an individual.

BACKGROUND OF THE INVENTION

The present invention relates to nucleic acids encoding dog gastric lipase (DGL), and other polypeptide derivatives of the latter possessing a lipase activity, as well as their use, especially for the production of these polypeptides. The subject of the invention is also the polypeptides encoded by these nucleic acids, and the use of these polypeptides in pharmaceutical compositions.

DGL is a glycoprotein of about 380 amino acids (AA) of a molecular weight of about 49 kilo-daltons (KD) synthesized in the form of a precursor containing a signal peptide at the amino-terminal (NH₂ -terminal) end and secreted by the median cells of dog stomach fundic mucosa (Carriere F. et al. Eur. J. Biochem. 202 (1991) 75-83).

This enzyme belongs to a family of so-called "preduodenal" lipases of which certain members have already been purified and sometimes even cloned (Docherty A. J. P. et al., Nucl. Ac. res. 13 (1985) 1891-1903; Bodmer M. W. et al., Biochem. Biophys. Act. 909 (1987) 237-244; Moreau H. et al., Biochem. Biophys. Act. 960 (1988) 286-293; European Patents No. 0,191,061 and No. 0,261,016).

For a long time, it was taken for granted that the hydrolysis of food lipids occurred in the small intestine by virtue of the action of enzymes produced by the pancreas (Bernard C., C. R. Acad. Sci. 28 (1849) 249-253).

Observations suggested, however, that the hydrolysis of triglycerides could occur in the stomach by means of preduodenal enzymes (Volhard, F., Z. Klin. Med. 42 (1901) 414-429; Shonheyder, F. and Volquartz, K. Acta Physiol. Scand. 9 (1945) 57-67). These enzymes, and in particular dog gastric lipase, have enzymatic and physico-chemical properties which distinguish them from mammalian pancreatic lipases. These differences between gastric and pancreatic lipases essentially relate to the following points: molecular weight, amino acid composition, resistance to pepsin, substrate specificity, optimum pH for action, and stability in acidic medium.

Furthermore, in vitro, under certain conditions, a synergy of action between gastric and pancreatic lipases can be detected on the hydrolysis of long-chain triglycerides (Gargouri, Y. et Al., Biochem. Biophys. Act. 1006 (1989) 255-271).

Several pathological conditions (cystic fibrosis, pancreatic exocrine insufficiency) are known where the patients totally or partially lack pancreatic exocrine secretion and therefore the enzymes necessary for the hydrolysis of foods (amylases, lipases, proteases). The non-absorption of fats at the intestinal level, and especially long-chain triglycerides, results in a very substantial increase in steatorrhea in these patients and in a very substantial slowing down of weight gain in young patients. In order to overcome this, pig pancreatic extracts are administered to these subjects at the time of meals. The therapeutic efficacy of these extracts could be greatly improved by the co-prescription of DGL by virtue of its specificity of action on long-chain triglycerides.

The purification and the determination of the NH₂ -terminal sequence of DGL are described in the article by F. Carriere which appeared in Eur. J. Biochem. 201, 75-83, 1991. A process permitting the extraction of this enzyme from dog stomachs is also described in this publication. This process consists essentially in subjecting dog stomachs to an extraction by an acidic aqueous medium (pH 2.5); the lipase extract is precipitated by addition of water-soluble salts, then by a filtration on a molecular sieve, followed by a separation by ion-exchange chromatographies, as well as by gel filtration, and an elution fraction containing the lipase is recovered. The purified DGL obtained by these processes has a molecular weight according to the Laemmli technique of 49,000 daltons, of which 6000 correspond to sugars and 43,000 to a protein.

Obvious reasons of difficulties of supply of dog stomachs prevent any development of this process both at the laboratory level and at the industrial level, hence the necessity to find a process avoiding the use of dog stomachs, which makes it possible to produce DGL in a large quantity.

SUMMARY OF THE INVENTION

The aim of the present invention is precisely to permit the production of DGL on an industrial scale by removing any problem of supply of raw material, and at an advantageous cost price.

The invention stems from the discovery made by the inventors of the nucleotide sequence of the messenger RNA (mRNA) encoding DGL, after cloning of the complementary DNA (cDNA) of this mRNA by means of a probe corresponding to the nucleotide sequence of the rabbit recombinant gastric lipase described in the French patent application filed on 13 Nov. 1991 and published under the number 2 683 549.

The present invention relates to a nucleic acid that is constituted by a first DNA fragment represented in FIG. 8 (SEQ ID NO 1), or a second DNA fragment delimited by the nucleotides situated at positions 1 and 1137 (SEQ ID NO 2) of the DNA represented in FIG. 8, wherein either of the first or second DNA fragments encodes the polypeptide delimited by the amino acids situated at positions 1 and 379 (SEQ ID NO 3) of the amino acid sequence represented in FIG. 9A, this polypeptide corresponding to dog gastric lipase.

The invention also relates to a recombinant nucleic acid comprising one of the nucleic acids described above which is inserted into a nucleotide sequence which is heterologous with respect to such nucleic acids. This recombinant nucleic acid comprises a promoter situated upstream of the nucleic acid under whose control the nucleic acid is transcribed, as well as a sequence encoding signals for termination of transcription which is situated downstream of the nucleic acid.

Other embodiments of the invention relate to a recombinant vector, a host cell and a process for preparing a polypeptide. The vector comprises one of these recombinant nucleic acids and elements necessary for promoting and controlling the expression of these nucleic acids in a host cell, and more particularly to a promoter recognized by the polymerases of the host cell. The host cell, particularly of the prokaryotic or eukaryotic type, is transformed by the recombinant vector. This host cell comprises one of the recombinant nucleic acids defined herein and the regulatory elements which permit the expression of these nucleic acids. Also disclosed is a process for the preparation of a polypeptide encoded by a nucleic acid, comprising the steps of culturing one of these host cells in an appropriate culture medium, and recovering the polypeptide produced by the host cell, either directly from the culture medium, or after lysis of the host cell.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will be more particularly illustrated by means of FIGS. 1 to 12 whose legends are the following:

FIG. 1. Polypeptide sequences of the cleavage region of rabbit (SEQ ID NO: 12), human (SEQ ID NO: 13) and rat (SEQ ID NO: 14) gastric lipase precursors, and comparison with the NH₂ -terminal sequence of dog gastric lipase (SEQ ID NO 11).

FIG. 2A. Design of a degenerate oligonucleotide (DGL₁) encoding the cleavage region of the DGL precursor from comparison of its rabbit (SEQ ID NO: 20), human (SEQ ID NO: 19), and rat (SEQ ID NO: 21) homologs.

FIG. 2B. Sequence of the oligonucleotides DGL₁ (SEQ ID NO 7) and DPL₂ (SEQ ID NO 8).

FIG. 3. Map of the clone 3.12.

FIG. 4. Scheme for cloning DGL into the vector pBluescript KS(+) and subcloning of the "H" fragment of the clone 3.12. into pKSPCR: production of the clone pKSDGL10.

FIG. 5. Restriction map of the plasmid vector pRU303.

FIG. 6. Nucleotide sequence (SEQ ID NO: 15) of the EcoRI-NdeI DNA fragment of the plasmid pRU303.

FIG. 7. Subcloning of the cDNA of dog gastric lipase into the expression vector pRU303 and construction of the plasmid pDGL5.303.

FIG. 8. Nucleotide sequence of the cDNA encoding the mature DGL (SEQ ID NO 1) This FIG. 8 includes drawings 8A, 8B and 8C.

FIG. 9A. Polypeptide sequence of the mature DGL (SEQ ID NO 3) This FIG. 9A includes drawings 9A1 and 9A2.

FIG. 9B. Comparison of the polypeptide sequences of HGL (human gastric lipase) (SEQ ID NO 16) and DGL, and determination of the % homology This FIG. 9B includes drawings 9B1 and 9B2.

FIG. 9C. Comparison of the polypeptide sequences of RATLL (rat lingual lipase) (SEQ ID NO 17) and DGL, and determination of the % homology This FIG. 9C includes drawings 9C1 and 9C2.

FIG. 9D. Comparison of the polypeptide sequences of RGL (rabbit gastric lipase) (SEQ ID NO 18) and DGL, and determination of the % homology This FIG. 9D includes drawings 9D1 and 9D2.

FIG. 10. Mutagenesis in vitro of the cDNA of DGL by the "PCR" technique by means of oligonucleotide primers DGL₂ (SEQ ID NO 9) and DGL₃ (SEQ ID NO 10) for the construction of the plasmid pDGL5.303.

FIG 11. Analysis by SDS-polyacrylamide gel electrophoresis of the proteins synthesized, in the absence or in the presence of IPTG, in E. coli W3110 I^(q) transformed with the plasmid pDGL5.303.

FIG. 12. Immunodetection by means of specific antibodies of the DGL synthesized in E. coli W3110 I^(q) transformed with the plasmid pDGL5.303 after "Western" type transfer onto nylon membrane of the proteins derived from these bacteria.

THE DETAILED DESCRIPTION OF THE INVENTION

Present invention relates to any nucleic acid characterized in that it comprises all or part of the DNA fragment represented in FIG. 8 (SEQ ID NO 1), and more particularly all or part of the DNA fragment delimited by the nucleotides situated at positions 1 and 1137 (SEQ ID NO 2) of the DNA represented in FIG. 8, this DNA fragment encoding the polypeptide delimited by the amino acids situated at positions 1 and 379 of the amino acid sequence represented in FIG. 9A (SEQ ID NO 3), this polypeptide corresponding to the mature DGL.

The expression DGL, above and below, is understood to mean any lipase secreted by the gastric mucous membrane or by a pregastric mucous membrane in dogs.

The abovementioned nucleic acids may also comprise, upstream of the DNA fragment delimited by the nucleotides situated at positions 1 and 1137, of FIG. 8, a DNA fragment (more particularly a sequence ATG) encoding a methionine (SEQ ID NO 4).

The invention also relates to the abovementioned DNA fragments having, upstream of position 1137, a STOP codon, especially that consisting of the sequence delimited by the nucleotides situated at positions 1138, 1139 and 1140 of FIG. 8 (SEQ ID NO 6).

DGL, like all gastric lipases purified or cloned up until now, is synthesized in the form of a precursor consisting of a signal peptide preceding the polypeptide sequence of the mature protein.

Generally, the invention relates to any nucleic acid characterized in that it comprises, upstream of one of the abovementioned DNA fragments, a nucleotide sequence encoding a signal peptide.

As opposed to the double-stranded nucleic acids mentioned above, the invention also relates to the single-stranded nucleic acids consisting of either of the two complementary nucleotide sequences constituting the abovementioned DNA fragments.

The invention also relates to any nucleic acid capable of hybridizing with a single-stranded nucleic acid as described above, especially under the hybridization conditions mentioned in the following detailed description of the cloning of the cDNA of the DGL according to the invention.

Any nucleic acid encoding a polypeptide according to the invention and whose nucleotide sequence differs, according to the degeneracy of the genetic code, from the abovementioned nucleotide sequences, also enters within the scope of the present invention.

The subject of the invention is also any recombinant nucleic acid characterized in that it comprises a nucleic acid as described above according to the invention, inserted into a DNA molecule which is heterologous with respect to the abovementioned nucleic acid.

In this respect, the subject of the invention is more particularly any recombinant nucleic acid comprising a promoter situated upstream of the nucleic acid according to the invention, and under the control of which the transcription of the said nucleic acid is capable of being carried out, as well as a sequence encoding signals for termination of transcription which is situated downstream of the said nucleic acid.

The invention also relates to any recombinant vector, especially of the plasmid, cosmid or phage type, characterized in that it contains a recombinant nucleic acid as described above, inserted at one of its sites which are non-essential for its replication.

The recombinant vectors according to the invention are advantageously characterized in that they contain, at one of their sites which are non-essential for their replication, elements necessary for promoting and controlling the expression of a nucleic acid according to the invention in a host cell, and more particularly a promoter recognized by the polymerases of the cellular host, especially an inducible promoter.

The invention also relates to any host cell, of the prokaryotic or eukaryotic type, transformed by a recombinant vector as described above, and comprising the regulatory elements permitting the expression of a gene or a cDNA according to the invention.

By way of examples of host cells capable of being transformed by a recombinant vector according to the invention, there may be mentioned mammalian cells such as COS or CHO cells, cells of insects capable of being infected by a recombinant virus of the baculovirus type, filamentous fungi such as Aspergillus niger or oryzae, yeasts such as Saccharomyces cerevisiae or Kluyveromyces lactis, as well as bacteria such as E. coli (Gram-negative bacterium) or B. subtilis (Gram-positive bacterium).

The subject of the invention is also DNA (or RNA) primers which can be used for the synthesis of nucleic acids according to the invention by the DNA chain amplification technique, designated below by PCR (Polymerase Chain Reaction) technique. This technique is more particularly described in American Pat. Nos. 4,683,202 and No. 4,683,195, as well as in European Patent No. 200,362. The primers according to the invention advantageously consist of about 15 to 40 nucleotides corresponding to the 3' and 5' ends of either of the two strands constituting the abovementioned DNA fragments.

The invention also relates to nucleotide probes derived from either of the two strands constituting the abovementioned DNA fragments of the invention, as well as the use of these probes, especially for the detection in a biological sample of the possible presence of DGL.

Advantageously, the probes of the invention consist of about 17 to 23 nucleotides. The detection of the presence of DGL in a sample is preferably performed after amplification of the number of copies of the DGL-encoding genes or mRNAs which may be present in this sample, by means of the primers indicated above.

In this respect, the invention also relates to a kit for implementing the abovementioned method of detection, comprising:

where appropriate, primers as described above, as well as the reagents for the preparation of a medium suitable for carrying out the amplification of the DNA or RNA sequence encoding DGL,

a nucleotide probe as described above, labeled where appropriate, especially radioactively or enzymatically, as well as the reagents for the preparation of a medium suitable for carrying out the hybridization between the probe and the abovementioned DNA or RNA sequence,

the reagents permitting the detection of the probe hybridized with the said sequence.

Advantageously, the nucleotide probes of the invention are capable of hybridizing both with the DNA or RNA sequence encoding DGL and with those encoding human gastric lipase (HGL) and rabbit gastric lipase (RGL). Such probes can be used for the implementation of a method of detection in vitro of the possible presence of HGL in a biological sample capable of containing the latter. Such a method of detection is carried out in the manner indicated above, and permits the in vitro diagnosis of pathologies linked to the overproduction, or conversely, to the insufficiency, or even the absence, of production of gastric lipase in the body.

The subject of the invention is also the polypeptides corresponding, according to the universal genetic code, to the nucleic acids according to the invention described above, or any fragment of these recombinant polypeptides, or any polypeptide modified by substitution and/or addition and/or suppression of one or more amino acids of these recombinant polypeptides, these modified fragments or polypeptides preserving the enzymatic properties of the abovementioned recombinant polypeptides.

"Recombinant polypeptide" should be understood to mean any molecule possessing a polypeptide chain capable of being produced by genetic engineering, by transcription and translation of a corresponding DNA sequence under the control of appropriate regulatory elements inside an effective host cell. Consequently, the expression "recombinant polypeptides" does not exclude the possibility that these polypeptides have undergone pos-translational sic! modifications such as glycosylation.

The term "recombinant" implies the fact that the polypeptide has been produced by genetic engineering, more particularly because of the fact that this polypeptide results from the expression in a host cell of nucleic acid sequences which have been previously introduced into an expression vector used in the said host.

However, it should be understood that this expression does not exclude the possibility that the polypeptide is produced by a different process, for example by conventional chemical synthesis according to the conventional methods used for the synthesis of proteins, or by cleavage of larger-sized molecules.

The invention also relates to the abovementioned polypeptides in biologically pure form. The expression "biologically pure" should be understood to mean, on the one hand, a degree of purity enabling the recombinant polypeptide to be used for the production of pharmaceutical compositions and, on the other hand, the absence of contaminants, more particularly of natural contaminants.

In this respect, the invention more particularly relates to:

the polypeptide delimited by the amino acids situated at position 1 and 379 of the amino acid sequence represented in FIG. 9A (SEQ ID NO 3), and corresponding to the mature DGL as obtained by genetic engineering, and whose molecular weight varies from about 43,200 to about 50,000 daltons, according to whether the host in which it is produced carries out post-translational modifications on the polypeptide chain of this DGL,

the abovementioned polypeptides whose amino acid sequences are preceded by a methionine (SEQ ID NO 5).

Advantageously, the abovementioned polypeptides according to the invention, and more particularly the recombinant DGL, possess a lipolytic activity of between about 50 U/mg of polypeptide and about 750 U/mg of polypeptide, and preferably greater than 250 U/mg of polypeptide when measured by means of a short-chain triglyceride (such as tributyrin), as substrate according to the Gargouri method (more particularly described in the detailed description which follows from the invention). One unit U corresponds to the quantity of enzyme necessary to liberate one μmol of H⁺ ions (that is to say of free fatty acids) per minute at 37° C.

The maximum lipolytic activity of the recombinant polypeptides, according to the invention, on long-chain fatty acids is advantageously obtained at pH values of 3 to 5.

According to another advantageous aspect of the recombinant polypeptides of the invention, their lipolytic activity remains unchanged after incubation lasting for one hour at pH 2 and at 37° C.

The present invention also relates to a process for the preparation of a polypeptide as described above, this process comprising the following sequence of steps:

the culture of a host cell, transformed by a recombinant vector as described above, in an appropriate culture medium, and

the recovery of the polypeptide produced by the said host cell, either directly from the abovementioned culture medium, when the sequence encoding the said polypeptide is preceded by a signal sequence and the host cell is capable of secreting the polypeptide into the culture medium (especially in the case of eukaryotic cells and yeasts), or after lysis of the host cell (especially in the case of bacteria).

Where appropriate, the recovery step is followed by a step of purification of the recovered polypeptide, and especially after recovery by lysis of the bacterium by a step for solubilization of the polypeptide, then its rematuration.

The agents and techniques for solubilization of polypeptides obtained in the form of inclusions are well known to persons skilled in the art. Essentially, the solubilizing agents are urea, quaternary ammonium halides such as guanidinium chloride or cetyltrimethylammonium chloride which are used in experimental procedures such as those described by N. K. Puri et al. in Biochem. J (1992) 285, 871-879.

Advantageously, as already specified above, the nucleotide sequences encoding the polypeptides whose production is desired, and inserted into the vector used to transform the host cells, are preceded by a signal sequence thus permitting the secretion of the polypeptides produced outside the host cells and their recovery directly from the culture medium without having to carry out the lysis of the said host cells.

By way of example, it will be possible to obtain the synthesis of the mature DGL in mammalian cells such as COS cells or CHO cells by inserting the nucleic acid encoding the DGL precursor into an appropriate expression vector.

The presence of the DNA segment encoding the signal peptide will permit the cellular machinery to glycosylate in the endoplasmic reticulum and to secrete the DGL in the culture medium in biologically active form.

Alternatively, it will be possible to obtain the production of dog gastric lipase by insect cells by inserting the cDNA encoding DGL or its precursor behind an appropriate promoter in the genome of a virus of the baculovirus type which is capable of infecting the said cells.

In order to cause DGL to be produced and secreted by a yeast such as Saccharomyces cerevisiae or Kluyveromyces lactis, it will be preferable to replace, in the cDNA, the DNA segment encoding the signal peptide of the DGL by a DNA fragment encoding a signal peptide of yeast protein. The recombinant cDNA thus obtained will then be introduced into an expression vector specific for the host considered. Such expression systems are now relatively common. For example, there may be mentioned the expression of human serum albumin (European Patent No. 0,361,991 A2) or calf chymosin (Van den Berg J. A. et al., Biotechnology 8 (1990) 135-139).

Escherichia coli is a Gram-negative bacterium having a wall, in which the phenomena of secretion of proteins into the culture medium are extremely reduced. A certain number of proteins accumulate in the bacterial periplasm by virtue of the presence of signals similar to the signal sequences of eukaryotic proteins. Among the latter, there may be mentioned the products of the phoA and malE genes for example. Certain regions of these genes have been used to produce heterologous proteins in the periplasmic space of E. coli. However, the synthesis in the cytoplasm of foreign proteins remains the best-known system and the most frequently used in E. coli.

The observance of certain rules deduced from experience during the production of plasmid constructs makes it possible to optimize the level of expression of the proteins of interest.

In a first stage, it will be appropriate to place the cDNA encoding the mature part of DGL, that is to say lacking the segment encoding the signal peptide, behind a powerful bacterial or phage promoter. To avoid problems of possible toxicity of the foreign protein in the bacterium, a promoter will be preferably chosen which is inducible by a chemical agent (Lac or Trp promoters) or by a physical agent such as change of temperature (P_(L) promoter and cI857 repressor). The cDNA should be contiguous, in its 5' terminal region, to an ATG sequence specifying the initiation of protein synthesis on the messenger RNA. This initiator ATG should be preceded, at a distance of 6 to 12 base pairs, by a region rich in purines, called Shine-Dalgarno region, and corresponding, on the messenger RNA, to the ribosome-binding site.

It will be possible to modify the composition of the sequence of the DNA segment situated between the Shine-Dalgarno region and the initiator ATG so as to reduce the elements of secondary structure around the AUG initiation codon on the messenger RNA. Once the necessary modifications have been made, the vector will be introduced into an appropriate host.

The invention relates to the antibodies directed against the polypeptides of the invention, and more particularly those directed against DGL and capable of also recognizing HGL and RGL. Such antibodies can be obtained by immunization of an animal with these polypeptides followed by the recovery of the antibodies formed.

It goes without saying that this production is not limited to polyclonal antibodies.

It also applies to any monoclonal antibody produced by any hybridoma capable of being formed, by conventional methods, from the spleen cells of an animal, especially mouse or rat, which are immunized against one of the purified polypeptides of the invention, on the one hand, and the cells of an appropriate myeloma on the other hand, and being selected for its capacity to produce monoclonal antibodies recognizing the polypeptide initially used for the immunization of the animals, as well as HGL.

The invention also relates to the use of these antibodies for the implementation of a method of detection or assay of DGL or of HGL in a biological sample capable of containing it.

The invention more particularly relates to the use of these antibodies for the implementation of a method of diagnosis in vitro of pathologies linked to the overproduction, or conversely, to the insufficiency, or even the absence of production of lipase in the body.

This in vitro diagnostic method, performed using a biological sample collected from a patient, comprises a step of bringing into contact with this sample, followed by a step of detection of the possible antibody-HGL complexes formed during the preceding step.

In this respect, the invention also relates to a kit for implementing a method of detection or diagnosis in vitro mentioned above, comprising:

antibodies as described above, advantageously labeled radioactively or enzymatically, as well as the reagents for the preparation of a medium suitable for carrying out the immunological reaction between these antibodies and HGL,

the reagents permitting the detection of the immunological complexes formed between these antibodies and HGL.

The invention also relates to the use of one or more polypeptides described above, for the production of pharmaceutical compositions which can be used especially orally, intended to facilitate the absorption of the animal or vegetable fats ingested by a healthy individual or an individual suffering from one or more pathologies affecting or otherwise the level of production of gastric lipase. In particular, such compositions are advantageously used in individuals undergoing a medical treatment altering the mechanism of absorption of fats, or alternatively in elderly persons.

The invention relates more particularly to the use of one or more polypeptides described above for the production of medicinal products intended for the treatment of pathologies linked to the insufficiency, or even the absence, of production of lipases in the body, and more particularly of pathologies such as cystic fibrosis, and pancreatic exocrine insufficiency.

The subject of the invention is also pharmaceutical compositions comprising at least one polypeptide according to the invention, where appropriate in combination with one or several other polypeptides with lipase activity, in combination with a pharmaceutically acceptable vehicle.

The pharmaceutical compositions according to the invention are preferably administrable orally, and are provided especially in the form of hard gelatin capsules, tablets or powders for dilution.

The daily dosage in man is advantageously of about 400 mg to about 1,200 mg, preferably divided during the main meals, equivalent to an amount of about 130 mg to about 400 mg per meal.

The invention also relates to the use of the polypeptides as described above according to the invention or any other mammalian gastric lipase and derivatives of the said polypeptides, for the implementation of enzymatic bioconversion reactions (such as enzymatic hydrolyses or transesterifications), especially in immobilized form on a solid support.

In the case of the preparation of the nucleic acids of the invention, the latter can be carried out chemically, especially according to one of the following processes.

An appropriate mode of preparation of the nucleic acids (comprising a maximum of 200 nucleotides) of the invention by the chemical route comprises the following steps:

the synthesis of DNA using the automated β-cyanethyl phosphoramidite method described in Bioorganic Chemistry 4; 274-325, 1986,

the cloning of the DNAs thus obtained into an appropriate plasmid vector and the recovery of the DNAs by hybridization with an appropriate probe.

A mode of preparation, by the chemical route, of nucleic acids of length greater than 200 nucleotides, comprises the following steps:

the assembly of chemically synthesized oligonucleotides, provided at their ends with various restriction sites, whose sequences are compatible with the amino acid linkage of the natural polypeptide according to the principle described in Proc. Nat. Acad. Sci. USA 80; 7461-7465, 1983,

the cloning of the DNAs thus obtained into an appropriate plasmid vector and the recovery of the desired nucleic acid by hybridization with an appropriate probe.

The invention will be more particularly illustrated with the aid of the following detailed description of the construction of recombinant vectors according to the invention and their use for the production of DGL.

An RNA preparation was prepared from mucosa isolated from the fundic region of dog stomach. The messenger RNAs isolated by affinity chromatography on an oligo-dT cellulose column were converted into complementary DNA (cDNA) by the use of specific enzymes: Rous Sarcoma Virus reverse transcriptase and E. coli DNA polymerase I (Klenow fragment). This cDNA was introduced into the vector pUC18 after certain modifications and the recombinant molecules were used to transform the bacterium E. coli MM294. The transformant clones were screened by in situ hybridization by means of a probe containing the cDNA of radioactively labeled rabbit gastric lipase. After autoradiography, the bacterial colonies corresponding to a positive signal during the hybridization experiment were isolated and the plasmid DNA present in their cytoplasm amplified and purified.

After screening of the clones obtained, the clone 3.12 was selected and sequenced. This clone contains a PstI-PstI insert of 1201 base pairs, which insert is itself divided into two unequal parts H and L by a PstI restriction site.

No clone containing the complete cDNA could be detected at this stage.

In order to isolate the clone containing the cDNA encoding the mature dog lipase, an additional technique was used.

An mRNA fraction derived from the starting preparation is converted into single-stranded cDNA by means of the enzyme reverse transcriptase and an oligo- nucleotide primer DPL₂ (FIG. 2B) obtained from the 3' terminal sequence of the cDNA contained in the clone 3.12 previously isolated and sequenced.

The cDNA encoding the mature part of the DGL is then obtained and amplified by the PCR method, in the presence of Taq Polymerase and two oligonucleotide primers, DPL₂ as mentioned above, and DGL₁ designed from comparison of the 5' terminal nucleotide sequences of human and rabbit gastric lipases, of rat lingual lipase and of the known NH₂ -terminal protein sequence of DGL.

The double-stranded cDNA thus obtained was introduced into the vector pBluescript KS(+) after certain modifications and the recombinant molecules were used to transform the bacterium E. coli MM294. The transformant clones were screened by PCR using oligonucleotide probes corresponding to the parts of the sequence of the vector pBluescript KS(+) situated on either side of the insert. The clone pKSPCR containing an insert of 700 base pairs was selected and sequenced.

At the same time, after digestion with the restriction enzyme PstI, the "H" fragment of the cDNA insert of the clone 3.12 as obtained earlier is inserted into the plasmid pKSPCR linearized with PstI; a clone pKSPCR 10 is obtained which contains the cDNA encoding the mature dog gastric lipase.

Analysis of the nucleotide sequence of this cDNA made it possible to detect an open reading frame of 1137 nucleotides (NT) corresponding to a protein of 379 AA and a molecular weight of 43222 daltons.

Comparison with the nucleotide sequences of the other preduodenal lipases (Docherty, A. P. J. et al. (1985) op. cit.; Bodmer, M. W. et al. (1987) op. cit.; Moreau, H. et al. (1988) op. cit.) reveals a homology of 84.7% with HGL, and of 75.7% with RATLL and 81% with RGL in the coding regions.

Alternatively, a second process can be used for the production of a clone containing the cDNA encoding the mature DGL.

In the case where the mRNAs extracted from dog stomach mucosa, isolated by affinity chromatography on an oligo-dT column, and converted into cDNA by virtue of the use of specific enzymes (Rous Sarcoma Virus reverse transcriptase and E. coli DNA polymerase I) correspond to the whole mRNA of the mature DGL or its precursor, it will be possible to introduce the cDNA thus obtained after certain modifications in the vector pUC18 and the recombinant molecules used to transform a host cell, preferably bacterium or yeast; the transformant clones will be screened by in situ hybridization using probes derived from rabbit gastric lipase.

After autoradiography, the colonies of host cells corresponding to a positive signal during the hybridization will be isolated and the plasmid DNA present in the cytoplasm of these cells amplified and purified. Advantageously, the general cloning techniques used in this second process will be the same as those used in the process described earlier.

General Cloning Techniques

The conventional molecular biology methods such as purification of the messenger RNAs, the extraction and purification of plasmid DNA, the digestion with restriction enzymes, electrophoresis on agarose or polyacrylamide gel, electroelution from agarose gel of DNA fragments, transformation in E. coli, are described in the literature (Maniatis, T. et al., "Molecular cloning: a laboratory manual, Second Edition", Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989; Ausubel, F. M. et al. (eds.), "Current Protocols in Molecular Biology", John Willey and Sons, New York, 1987).

The "random priming" is performed according to the method described by Feinberg and Wogelstein (Anal. Biochem. (1983) 132: 6; Anal. Biochem. (1984) 137: 266).

The enzymes are obtained from the Companies Boehringer or New England Biolabs and used under the conditions recommended by the suppliers.

The DNA fragments intended to be assembled are separated according to their size by electrophoresis on 1% agarose gel, purified by electroelution and precipitated with ethanol. The ligation of the DNA fragments is carried out in the presence of T₄ DNA ligase at 4° C. or at 16° C. in an appropriate buffer according to whether the pieces to be assembled possess blunt or cohesive ends.

The sequencing of the DNA is carried out according to the dideoxynucleotide method (Sanger, F. et al., Proc. Natl. Acad. Sci. USA. 74 (1977) (5463-5467) using a "T7 sequencing" kit (Pharmacia).

The enzymatic amplification of specific DNA fragments is carried out according to the "Polymerasecatalysed Chain Reaction" or PCR method (Saiki, R. K. et al., Science 220 (1985) 1350-1354) using a PREM III LEP Scientific apparatus.

The oligonucleotides used as primers in the PCR or sequencing reactions are synthesized using a PCR-MATE Model 391 DNA synthesizer (Applied Biosystems) and purified by high-performance liquid chromatography before they are used.

The recombinant DNA molecules are used to transform competent cells of the following strains of E. coli:

MM294 F-, endA1, hsdR17 (r_(k) -m_(k) +), supE44, thi-1, relA1!, or

W3110 I^(q) F'TraD36, LacI^(q), (lac Z)M15, pro⁺ !.

The plasmid DNA is extracted from the bacterial transformants resistant to ampicillin according to a procedure derived from the alkaline lyse method described by Birnboin and Doly (Birnboim, H. C. and Doly, J., Nucl. Ac. Res. 7 (1979) 1512-1523).

The immunodetection of the dog gastric lipase synthesized in the bacterium E. coli W3110 I^(q), after addition of IsoPropylThioGalactopyranoside (IPTG) to the culture medium, is carried out by an immunoblotting method onto nylon membrane using an anti-DGL guinea-pig antibody and the kit for revealing with ImmunoPure ABC peroxidase (Pierce).

The preparation of the DGL is advantageously illustrated, although with no limitation being implied, by the following example of expression of DGL in the bacterium E. coli W3110 I^(q).

The process for the preparation of the lipase comprises several steps which are detailed in the following text:

Step No. 1: Cloning of a cDNA encoding dog gastric lipase.

1.1. Isolation and purification of the messenger RNAs from dog stomach fundic mucosa.

After grinding the tissues in a buffer containing lithium chloride and urea (Auffray, C. and Rougeon, F., Eur. J. Biochem. 107 (1980) 303-314), the total RNA is separated from the DNA by selective precipitation with lithium chloride. The proteins contaminating the RNA are then removed by phenol extraction. The messenger RNAs, polyadenylated at their 3'OH end, are separated from the ribosomal RNAs by chromatography on an oligo-dT cellulose column (Maniatis, T. et al., already cited). 75 micrograms of messenger RNA are thus obtained per gram of tissue.

1.2. Detection of the messenger RNA encoding DGL in the messenger RNA preparation extracted from dog stomach fundic mucosa.

The dog gastric lipase was purified to homogeneity and its NH₂ -terminal polypeptide sequence determined (Carriere F. et al., already cited).

A probe consisting of the DNA encoding the rabbit lipase precursor is used to verify the presence of an mRNA encoding DGL in the preparation obtained.

A "Northern" type hybridization experiment is carried out. A sample of 20 μg of dog stomach messenger RNA is denatured at 60° C., in the presence of glyoxal and DMSO, and then the mRNAs are separated according to size, by electrophoresis, on a 1% agarose gel in 10 mM phosphate buffer pH=7 (Thomas, P., Proc. Natl. Acad. Sci. USA, 77 (1980) 5201-5205).

After electrophoresis, the messenger RNA is transferred onto nylon membrane (Biodyne PALL) according to the procedure recommended by the supplier.

The cDNA fragment corresponding to the rabbit lipase is labeled by "random priming". The membranes previously obtained are hybridized individually for 36 hours at 37° C. in a 5× SSC buffer -5× Denhardt--50 mM sodium phosphate, pH=6.5--0.1% SDS--50% formamide, containing 10 ng/ml of the radioactive probe (Ausubel, F. et al. (eds), already cited). The temperatures used take into account the possible sequence homologies between RGL and DGL. An mRNA of about 1700 nucleotides hybridizes with the radioactive probe.

1.3. Synthesis of complementary DNA from dog stomach mRNA and insertion into the vector pUC18.

The synthesis of double-stranded cDNA is carried out starting with 4 μg of polyA+ RNA, in the presence of 50 units of AMV reverse transcriptase, 100 ng of an oligo-dT primer and E. coli DNA polymerase I.

A fraction of this DNA is inserted into the vector pUC18 by means of oligo-dC and oligo-dG tails, which are added respectively onto the cDNA and onto the vector previously linearized with the enzyme PstI (Gubler, U. and Hoffman, B. J., Gene 25 (1983) 263-269).

The hybrid molecules are used to transform competent bacteria E. coli MM294. The selection of the transformants is carried out by plating the product of the transformation onto a solid nutrient medium (LB-Agar) containing ampicillin at 50 mg/liter.

1.4. Isolation of the cDNA encoding DGL.

The bacterial colonies derived from the transformation are transferred onto nylon membranes (Biodyne PALL) and lysed according to a process recommended by the supplier. The effect of this operation is to denature and to bind onto the membrane the bacterial and plasmid DNA contained in the colonies.

After several washes in a 3× SSC buffer--0.1% SDS, at room temperature and then at 65° C., the filters are prehybridized for two hours at 65° C. in a 6× SSC buffer--10× Denhardt--0.1% SDS, and then hybridized at 50° C. in the same buffer containing the rabbit probe labeled with ³² P by "random priming", at the rate of 0.5 μci per ml of buffer.

The filters are washed in a 2× SSC buffer--0.1% SDS at room temperature then at 50° C. in the same buffer, before being subjected to autoradiography for 24 to 48 hours.

A screening of the colonies is carried out on two series of filters with the rabbit probe. The clone 3.12 is thus obtained (FIG. 3).

1.5. Synthesis of cDNA by means of the specific oligonucleotide DPL₂ and insertion into the vector pBluescript KS(+).

1.5.a. Synthesis of cDNA by means of the specific oligonucleotide DPL₂.

A synthesis of single-stranded cDNA is carried out starting with 5 μg of polyA+RNA, in the presence of 50 units of AMV reverse transcriptase and 100 ng of a synthetic oligonucleotide DPL₂ specific for DGL, and corresponding to the sequence in 3' of the cDNA contained in the clone 3.12 described earlier. After extraction of the solution with phenol-chloroform and precipitation in alcohol, the pellet obtained is dissolved in 20 microliters of distilled water; the single-stranded cDNA in solution is then amplified and converted into double-stranded DNA by the "PCR" technique by means of the primers DGL₁ and DPL₂ which are presented in FIG. 2B, so as to be cloned into an appropriate vector. The DGL₁ primer used above consists of a mixture of 12 sequences, each of these sequences corresponding to one of the possible combinations for representation of DGL₁ taking into account the fact that two T nucleotides can be replaced with one C nucleotide, and that one G nucleotide can be replaced with one T nucleotide or one A nucleotide at the positions indicated below the DGL₁ primer represented in FIGS. 2A and 2B.

1.5.b. Insertion into the vector pBluescript KS(+).

After digestion with the enzyme PstI, the 700 bp fragment of cDNA is inserted into the vector pBluescript KS(+) digested with the restriction enzymes SmaI and PstI.

The recombinant molecules derived from the ligation are used to transform competent bacteria E. coli MM294. The selection of the transformants is carried out by plating the product of the transformation on a solid nutrient medium (LB-Agar) containing ampicillin at 50 mg/liter.

The clone pKSPCR is thus obtained.

1.5.c. Ligation of the "H" fragment of the clone 3.12 into the plasmid pKSPCR.

The clone 3.12 is digested with the restriction enzyme PstI; the PstI-PstI "H" fragment of 850 base pairs of the clone 3.12 corresponding to the 3' region of the DGL cDNA is inserted into the plasmid pKSPCR previously linearized with the enzyme PstI.

The combination of these steps is presented in FIG. 4.

1.6. Isolation of the cDNA from the clone pKS DGL10.

A clone, pKS DGL10, was selected after screening by "PCR" (C. Blanchard and C. Benicourt, Boehringer, "Le brin complementaire", September 1992, No. 8, p6,). The cleavage by restriction enzymes of the plasmid pKSDGL10 shows that it contains a 1.5 Kb insert. This plasmid is prepared from one liter of bacterial culture for its detailed analysis and its sequencing.

The sequencing of the clone is carried out on double-stranded DNA by the Sanger method (Sanger, F. et al., Proc. Natl. Acad. Sci. USA. 74 (1977) 5463-5467).

The complete sequence of the cDNA contains 1528 nucleotides and is presented in FIG. 8. One open reading frame stretching from nucleotide 1 to nucleotide 1137 encodes a protein of 379 AA. The sequence of this protein is presented in FIG. 9A. This protein has 81% homology with rabbit gastric lipase (French Patent No. 91 13948).

Step No. 2: Construction of plasmids to express DGL in Escherichia Coli. 2.1. Choice of expression vector.

The vector chosen to express the DGL in E. coli is a plasmid in which a synthetic DNA fragment of 160 bp containing a Tac type promoter and a transcription terminator has been inserted between the EcoRI and NdeI sites of pBR322 (Bolivar, F. et al., Gene 2 (1977) 95-113). The restriction map of the vector pRU303 is presented in FIG. 5 and the nucleotide sequence of the EcoRI-NdeI DNA fragment in FIG. 6.

2.2. Construction of the plasmid pDGL5.303

In spite of exhaustive studies which have been carried out, few correlations have been established between the level of expression of a heterologous protein in a bacterium and the nucleotide sequence of the 5' terminal region of the messenger RNA of this same protein. However, a number of observations have made it possible to deduce certain empirical rules which can result in higher expression levels in the recombinant bacteria.

Among these "rules", there may be mentioned:

the distance between the Shine-Dalgarno region and the initiator AUG between 6 and 12 nucleotides,

a Shine-Dalgarno sequence rich in purines (AGGA),

a minimum secondary structure between Shine-Dalgarno and initiator AUG,

the absence of secondary structure (double strand) in the regions of messenger RNA containing the Shine-Dalgarno sequence and the initiator AUG.

Such constraints can be taken into account in the analysis programs which make it possible to define the nucleotide sequences of the non-coding 5' regions of the mRNAs capable of resulting in the best levels of expression of particular heterologous proteins, such as DGL in E. coli.

Using specific synthetic primers DGL₂ and DGL₃ which are presented in FIG. 10 and the "PCR" gene amplification technique, the cDNA encoding the mature part of DGL is positioned behind an ATG codon for initiation of translation and placed between nucleotide sequences such that it can be inserted into the expression vector pRU303 between the restriction sites BglII and SalI. Because of the presence, in the construct pDGL5.303, of an ATG codon immediately upstream of the sequences encoding DGL, the recombinant proteins obtained will possess, totally or partially, a methionine at their NH₂ -terminal end.

The recombinant plasmid pDGL5.303 whose construction scheme is represented in FIG. 7 was obtained in the strain E. coli MM294 and then transferred into the strain E. coli W3110 I^(q), which is frequently used for the expression of heterologous proteins. This strain contains the gene for the repressor LacI^(q) situated on a non-transferable episome F': the repressor synthesized in large quantity in the bacterium represses the expression of all the genes placed under the control of a lactose-type promoter.

Step No. 3: Expression of DGL in E. coli.

The plasmid pDGL5.303 was introduced into the host E. coli W3110 I^(q). The bacteria transformed by the plasmid are cultured in medium in the presence of M9 glucose sic! (Maniatis, T. et al., already cited). During the exponential growth phase, the expression of the dog gastric lipase is induced by addition of IPTG at the final concentration of 2 mM.

After 4 hours at 37° C., the bacteria are harvested, centrifuged and washed with PBS buffer. The bacteria are then lysed in a buffer containing SDS and β-mercaptoethanol for 10 minutes at 100° C.

Analysis of the proteins on electrophoresis gel under denaturing conditions makes it possible to detect a protein band which may correspond to the lipase. The protein is expressed at a level such that it can be detected by this technique as shown in FIG. 11.

In order to ensure that this protein, which is induced by the addition of IPTG to the culture medium, indeed corresponds to the DGL, the proteins derived from cultures of bacteria transformed by the plasmid pLGC5.303, induced and non-induced by the chemical agent, are transferred onto a nylon membrane after they have been separated according to their size by SDS-polyacryl-amide gel electrophoresis.

The complex between the DGL and the anti-DGL antibody can be detected by means of a calorimetric reaction involving a second antibody coupled to an enzyme, horseradish peroxidase. The results are presented in FIG. 12.

Rather than being produced in the form of inclusion bodies in the cytoplasm of the bacterium Escherichia coli, dog gastric lipase can be advantageously secreted into the bacterial periplasm by inserting the mature enzyme-encoding cDNA into an appropriate vector, under the control of an inducible promoter by a physical or chemical agent, and downstream of a DNA segment encoding a signal peptide such as that present at the NH2 terminal end of the protein ompA (Movva N. R. et al. J. Biol. Chem. 256: 27-29, 1980).

Dog gastric lipase can also be synthesized in Escherichia coli in the form of a soluble fusion protein with Staphylococcus aureus protein A permitting its subsequent purification. To this end, the mature lipase-encoding cDNA is inserted into the vector pRIT2T (Nilsson B. et al. EMBO J.4: 1075-1080, 1985) which was previously modified in order to introduce therein a DNA fragment encoding the recognition site Ile-Glu-Gly-Arg for coagulation factor Xa. The fusion protein thus produced can be separated from the other proteins of the cytoplasm of the bacterium by affinity chromatography on an IgG-Sepharose column (Pharmacia). After elution of the column, the fusion protein is cleaved by factor Xa. The product of the hydrolysis is again subjected to a chromatography on an IgG-Sepharose column which retains protein A, thus making it possible to obtain the dog gastric lipase in the pure state in soluble form.

It is also possible to obtain dog gastric lipase from mammalian cells in culture. For that, the cDNA encoding the precursor of this lipase should be introduced into an appropriate vector such as the plasmid pCDNAI-Neo (Invitrogen corporation) under the control of the Cytomegalovirus (CMV) promoter, or alternatively the mature lipase-encoding cDNA into the same type of vector, but downstream of a DNA segment encoding the signal peptide of rabbit gastric lipase (Benicourt C. et al.; French Patent Application no. 2,633,549 cited above). The introduction of such recombinant plasmids into monkey kidney COS-7 cells constitutively expressing the SV40 virus T antigen makes it possible to transiently produce dog gastric lipase in the culture medium in an appreciable quantity. Cell lines constitutively expressing dog gastric lipase can be obtained by introducing one of two recombinant plasmids into hamster ovary cells (CHO) and by exerting a selection pressure with the antibiotic G418 or geneticin due to the presence of a gene for resistance to aminoglycosides such as neomycin on the said plasmids.

The detection of the activity of the recombinant DGL, especially that derived from a bacterial lysate obtained from a culture of W3110 I^(q) (pDGL5.303), is carried out by the method of Gargouri et al. (Gastroenterology 91 (1986) 265-275) using tributyrin as substrate.

The experimental conditions in which the specific activities of the recombinant polypeptides are determined will be recalled below.

The specific activity is defined as the ratio of the enzymatic activity to the quantity of proteins in the sample expressed in milligrams. The lipase activity is determined by the titrimetric method of Y. Gargouri (previously cited) in which the substrate used is tributyrin. The assay consists in neutralizing the butyric acid liberated under the action of the lipase by a 0.1N sodium hydroxide solution at constant pH of 6 and at a temperature of 37° C. Under these assay conditions, the enzymatic activity corresponds to the number of micromoles of acid which are liberated in one minute by the action of the product subjected to the assay.

Practically, the assay consists in introducing, into a titration cell thermostated at 37° C.:

Tributyrin: 0.50 ml,

Isotonic solution of bovine serum albumin and sodium taurodeoxycholate 14.50 ml (composition: 100 mg bovine serum albumin, 2 mM sodium taurodeoxycholate, 0.9% isotonic solution of NaCl q.s. one litre).

With electromagnetic stirring and with the aid of an automated titrimeter, the mixture is adjusted to pH 6 by addition of 0.1N sodium hydroxide. After stabilization of the pH at this value, 0.5 to 1 ml of an aqueous solution of the enzymatic compound to be assayed, exactly measured, is added. Under these experimental conditions, the quantity of 0.1N sodium hydroxide solution necessary to maintain the pH at 6 for 2 minutes makes it possible to calculate the lipase activity as defined earlier.

The lipolytic activity can also be measured by the method using a chromogenic substrate such as resorufin 1,2-0-dilauryl-rac-glycero-3-glutarate (Boehringer), which is described in the manufacturer's leaflet.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 21                                                  (2) INFORMATION FOR SEQ ID NO: 1:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1528 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:                                       TTGTTTGGAAAATTACATCCCACAAACCCTGAAGTGACCATGAATATAAGTCAGATGATC60                 ACCTACTGGGGATACCCAGCTGAGGAATATGAAGTTGTGACCGAAGACGGTTATATCCTT120                GGGATCGACAGAATTCCTTATGGGAGGAAAAATTCAGAGAATATAGGCCGGAGACCTGTT180                GCATTTTTGCAACACGGTTTGCTCGCATCAGCCACAAACTGGATCTCCAACCTGCCCAAC240                AACAGCCTGGCCTTCATCCTGGCCGACGCCGGGTACGACGTGTGGCTGGGGAACAGCAGG300                GGCAACACCTGGGCCAGGAGGAATCTGTACTACTCGCCCGACTCCGTCGAATTCTGGGCT360                TTCAGCTTTGACGAGATGGCTAAATATGACCTTCCCGCCACCATTGACTTCATCTTGAAG420                AAAACGGGACAGGACAAGCTACACTACGTTGGCCATTCCCAGGGCACCACCATTGGTTTC480                ATCGCCTTTTCCACCAATCCCAAGCTGGCGAAACGGATCAAAACCTTCTATGCATTAGCT540                CCCGTTGCCACCGTGAAGTACACCGAAACCCTGTTAAACAAACTCATGCTCGTCCCTTCG600                TTCCTCTTCAAGCTTATATTTGGAAACAAAATATTCTACCCACACCACTTCTTTGATCAA660                TTTCTCGCCACCGAGGTATGCTCCCGCGAGACGGTGGATCTCCTCTGCAGCAACGCCCTG720                TTTATCATTTGTGGATTTGACACTATGAACTTGAACATGAGTCGCTTGGATGTGTATCTG780                TCACATAATCCAGCAGGAACATCGGTTCAGAACGTGCTCCACTGGTCCCAGGCTGTTAAG840                TCTGGGAAGTTCCAAGCTTTTGACTGGGGAAGCCCAGTTCAGAACATGATGCACTATCAT900                CAGAGCATGCCTCCCTACTACAACCTGACAGACATGCATGTGCCAATCGCAGTGTGGAAC960                GGTGGCAACGACTTGCTGGCCGACCCTCACGATGTTGACCTTTTGCTTTCCAAGCTCCCC1020               AATCTCATTTACCACAGGAAGATTCCTCCTTACAATCACTTGGACTTTATCTGGGCCATG1080               GATGCCCCTCAAGCGGTTTACAATGAAATTGTTTCCATGATGGGAACAGATAATAAGTAG1140               TTCTAGATTTAAGGAATTATTCTTTTATTGTTCCAAAATACGTTCTTCTCTCACACGTGG1200               TTTTCTATCATGTTTGAGACACGGTGATTGTTCCCATGGTTTTGATTTCAGAAATGTGTT1260               AGCATCAACAATCTTTCCATTGGTAATTTTTGAATTTAAAATGATTTTTAAATTTGGGGC1320               ATCTGGGTGGCTCAGTTGGCTAAGTCGTCTGCCTTGGCTTAAGTCATGATCTCGGGGTCC1380               TAGGATGGAGCCTTGTGTCTGGGCTCCTGCCGGGGCGGGGGTCTGCTTCTCCTCCTGCTG1440               CTCCCCCCTGCTGCTGTGTGCACACACGCTCTCTCTCTCTCAAATAAATAAATAAATAAA1500               TACTTAATAAAATAAAAAAAAAAAAAAA1528                                               (2) INFORMATION FOR SEQ ID NO: 2:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1137 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1137                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:                                       TTGTTTGGAAAATTACATCCCACAAACCCTGAAGTGACCATGAATATA48                             LeuPheGlyLysLeuHisProThrAsnProGluValThrMetAsnIle                               151015                                                                         AGTCAGATGATCACCTACTGGGGATACCCAGCTGAGGAATATGAAGTT96                             SerGlnMetIleThrTyrTrpGlyTyrProAlaGluGluTyrGluVal                               202530                                                                         GTGACCGAAGACGGTTATATCCTTGGGATCGACAGAATTCCTTATGGG144                            ValThrGluAspGlyTyrIleLeuGlyIleAspArgIleProTyrGly                               354045                                                                         AGGAAAAATTCAGAGAATATAGGCCGGAGACCTGTTGCATTTTTGCAA192                            ArgLysAsnSerGluAsnIleGlyArgArgProValAlaPheLeuGln                               505560                                                                         CACGGTTTGCTCGCATCAGCCACAAACTGGATCTCCAACCTGCCCAAC240                            HisGlyLeuLeuAlaSerAlaThrAsnTrpIleSerAsnLeuProAsn                               65707580                                                                       AACAGCCTGGCCTTCATCCTGGCCGACGCCGGGTACGACGTGTGGCTG288                            AsnSerLeuAlaPheIleLeuAlaAspAlaGlyTyrAspValTrpLeu                               859095                                                                         GGGAACAGCAGGGGCAACACCTGGGCCAGGAGGAATCTGTACTACTCG336                            GlyAsnSerArgGlyAsnThrTrpAlaArgArgAsnLeuTyrTyrSer                               100105110                                                                      CCCGACTCCGTCGAATTCTGGGCTTTCAGCTTTGACGAGATGGCTAAA384                            ProAspSerValGluPheTrpAlaPheSerPheAspGluMetAlaLys                               115120125                                                                      TATGACCTTCCCGCCACCATTGACTTCATCTTGAAGAAAACGGGACAG432                            TyrAspLeuProAlaThrIleAspPheIleLeuLysLysThrGlyGln                               130135140                                                                      GACAAGCTACACTACGTTGGCCATTCCCAGGGCACCACCATTGGTTTC480                            AspLysLeuHisTyrValGlyHisSerGlnGlyThrThrIleGlyPhe                               145150155160                                                                   ATCGCCTTTTCCACCAATCCCAAGCTGGCGAAACGGATCAAAACCTTC528                            IleAlaPheSerThrAsnProLysLeuAlaLysArgIleLysThrPhe                               165170175                                                                      TATGCATTAGCTCCCGTTGCCACCGTGAAGTACACCGAAACCCTGTTA576                            TyrAlaLeuAlaProValAlaThrValLysTyrThrGluThrLeuLeu                               180185190                                                                      AACAAACTCATGCTCGTCCCTTCGTTCCTCTTCAAGCTTATATTTGGA624                            AsnLysLeuMetLeuValProSerPheLeuPheLysLeuIlePheGly                               195200205                                                                      AACAAAATATTCTACCCACACCACTTCTTTGATCAATTTCTCGCCACC672                            AsnLysIlePheTyrProHisHisPhePheAspGlnPheLeuAlaThr                               210215220                                                                      GAGGTATGCTCCCGCGAGACGGTGGATCTCCTCTGCAGCAACGCCCTG720                            GluValCysSerArgGluThrValAspLeuLeuCysSerAsnAlaLeu                               225230235240                                                                   TTTATCATTTGTGGATTTGACACTATGAACTTGAACATGAGTCGCTTG768                            PheIleIleCysGlyPheAspThrMetAsnLeuAsnMetSerArgLeu                               245250255                                                                      GATGTGTATCTGTCACATAATCCAGCAGGAACATCGGTTCAGAACGTG816                            AspValTyrLeuSerHisAsnProAlaGlyThrSerValGlnAsnVal                               260265270                                                                      CTCCACTGGTCCCAGGCTGTTAAGTCTGGGAAGTTCCAAGCTTTTGAC864                            LeuHisTrpSerGlnAlaValLysSerGlyLysPheGlnAlaPheAsp                               275280285                                                                      TGGGGAAGCCCAGTTCAGAACATGATGCACTATCATCAGAGCATGCCT912                            TrpGlySerProValGlnAsnMetMetHisTyrHisGlnSerMetPro                               290295300                                                                      CCCTACTACAACCTGACAGACATGCATGTGCCAATCGCAGTGTGGAAC960                            ProTyrTyrAsnLeuThrAspMetHisValProIleAlaValTrpAsn                               305310315320                                                                   GGTGGCAACGACTTGCTGGCCGACCCTCACGATGTTGACCTTTTGCTT1008                           GlyGlyAsnAspLeuLeuAlaAspProHisAspValAspLeuLeuLeu                               325330335                                                                      TCCAAGCTCCCCAATCTCATTTACCACAGGAAGATTCCTCCTTACAAT1056                           SerLysLeuProAsnLeuIleTyrHisArgLysIleProProTyrAsn                               340345350                                                                      CACTTGGACTTTATCTGGGCCATGGATGCCCCTCAAGCGGTTTACAAT1104                           HisLeuAspPheIleTrpAlaMetAspAlaProGlnAlaValTyrAsn                               355360365                                                                      GAAATTGTTTCCATGATGGGAACAGATAATAAG1137                                          GluIleValSerMetMetGlyThrAspAsnLys                                              370375                                                                         (2) INFORMATION FOR SEQ ID NO: 3:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 379 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:                                       LeuPheGlyLysLeuHisProThrAsnProGluValThrMetAsnIle                               151015                                                                         SerGlnMetIleThrTyrTrpGlyTyrProAlaGluGluTyrGluVal                               202530                                                                         ValThrGluAspGlyTyrIleLeuGlyIleAspArgIleProTyrGly                               354045                                                                         ArgLysAsnSerGluAsnIleGlyArgArgProValAlaPheLeuGln                               505560                                                                         HisGlyLeuLeuAlaSerAlaThrAsnTrpIleSerAsnLeuProAsn                               65707580                                                                       AsnSerLeuAlaPheIleLeuAlaAspAlaGlyTyrAspValTrpLeu                               859095                                                                         GlyAsnSerArgGlyAsnThrTrpAlaArgArgAsnLeuTyrTyrSer                               100105110                                                                      ProAspSerValGluPheTrpAlaPheSerPheAspGluMetAlaLys                               115120125                                                                      TyrAspLeuProAlaThrIleAspPheIleLeuLysLysThrGlyGln                               130135140                                                                      AspLysLeuHisTyrValGlyHisSerGlnGlyThrThrIleGlyPhe                               145150155160                                                                   IleAlaPheSerThrAsnProLysLeuAlaLysArgIleLysThrPhe                               165170175                                                                      TyrAlaLeuAlaProValAlaThrValLysTyrThrGluThrLeuLeu                               180185190                                                                      AsnLysLeuMetLeuValProSerPheLeuPheLysLeuIlePheGly                               195200205                                                                      AsnLysIlePheTyrProHisHisPhePheAspGlnPheLeuAlaThr                               210215220                                                                      GluValCysSerArgGluThrValAspLeuLeuCysSerAsnAlaLeu                               225230235240                                                                   PheIleIleCysGlyPheAspThrMetAsnLeuAsnMetSerArgLeu                               245250255                                                                      AspValTyrLeuSerHisAsnProAlaGlyThrSerValGlnAsnVal                               260265270                                                                      LeuHisTrpSerGlnAlaValLysSerGlyLysPheGlnAlaPheAsp                               275280285                                                                      TrpGlySerProValGlnAsnMetMetHisTyrHisGlnSerMetPro                               290295300                                                                      ProTyrTyrAsnLeuThrAspMetHisValProIleAlaValTrpAsn                               305310315320                                                                   GlyGlyAsnAspLeuLeuAlaAspProHisAspValAspLeuLeuLeu                               325330335                                                                      SerLysLeuProAsnLeuIleTyrHisArgLysIleProProTyrAsn                               340345350                                                                      HisLeuAspPheIleTrpAlaMetAspAlaProGlnAlaValTyrAsn                               355360365                                                                      GluIleValSerMetMetGlyThrAspAsnLys                                              370375                                                                         (2) INFORMATION FOR SEQ ID NO: 4:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1140 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1140                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:                                       ATGTTGTTTGGAAAATTACATCCCACAAACCCTGAAGTGACCATGAAT48                             MetLeuPheGlyLysLeuHisProThrAsnProGluValThrMetAsn                               151015                                                                         ATAAGTCAGATGATCACCTACTGGGGATACCCAGCTGAGGAATATGAA96                             IleSerGlnMetIleThrTyrTrpGlyTyrProAlaGluGluTyrGlu                               202530                                                                         GTTGTGACCGAAGACGGTTATATCCTTGGGATCGACAGAATTCCTTAT144                            ValValThrGluAspGlyTyrIleLeuGlyIleAspArgIleProTyr                               354045                                                                         GGGAGGAAAAATTCAGAGAATATAGGCCGGAGACCTGTTGCATTTTTG192                            GlyArgLysAsnSerGluAsnIleGlyArgArgProValAlaPheLeu                               505560                                                                         CAACACGGTTTGCTCGCATCAGCCACAAACTGGATCTCCAACCTGCCC240                            GlnHisGlyLeuLeuAlaSerAlaThrAsnTrpIleSerAsnLeuPro                               65707580                                                                       AACAACAGCCTGGCCTTCATCCTGGCCGACGCCGGGTACGACGTGTGG288                            AsnAsnSerLeuAlaPheIleLeuAlaAspAlaGlyTyrAspValTrp                               859095                                                                         CTGGGGAACAGCAGGGGCAACACCTGGGCCAGGAGGAATCTGTACTAC336                            LeuGlyAsnSerArgGlyAsnThrTrpAlaArgArgAsnLeuTyrTyr                               100105110                                                                      TCGCCCGACTCCGTCGAATTCTGGGCTTTCAGCTTTGACGAGATGGCT384                            SerProAspSerValGluPheTrpAlaPheSerPheAspGluMetAla                               115120125                                                                      AAATATGACCTTCCCGCCACCATTGACTTCATCTTGAAGAAAACGGGA432                            LysTyrAspLeuProAlaThrIleAspPheIleLeuLysLysThrGly                               130135140                                                                      CAGGACAAGCTACACTACGTTGGCCATTCCCAGGGCACCACCATTGGT480                            GlnAspLysLeuHisTyrValGlyHisSerGlnGlyThrThrIleGly                               145150155160                                                                   TTCATCGCCTTTTCCACCAATCCCAAGCTGGCGAAACGGATCAAAACC528                            PheIleAlaPheSerThrAsnProLysLeuAlaLysArgIleLysThr                               165170175                                                                      TTCTATGCATTAGCTCCCGTTGCCACCGTGAAGTACACCGAAACCCTG576                            PheTyrAlaLeuAlaProValAlaThrValLysTyrThrGluThrLeu                               180185190                                                                      TTAAACAAACTCATGCTCGTCCCTTCGTTCCTCTTCAAGCTTATATTT624                            LeuAsnLysLeuMetLeuValProSerPheLeuPheLysLeuIlePhe                               195200205                                                                      GGAAACAAAATATTCTACCCACACCACTTCTTTGATCAATTTCTCGCC672                            GlyAsnLysIlePheTyrProHisHisPhePheAspGlnPheLeuAla                               210215220                                                                      ACCGAGGTATGCTCCCGCGAGACGGTGGATCTCCTCTGCAGCAACGCC720                            ThrGluValCysSerArgGluThrValAspLeuLeuCysSerAsnAla                               225230235240                                                                   CTGTTTATCATTTGTGGATTTGACACTATGAACTTGAACATGAGTCGC768                            LeuPheIleIleCysGlyPheAspThrMetAsnLeuAsnMetSerArg                               245250255                                                                      TTGGATGTGTATCTGTCACATAATCCAGCAGGAACATCGGTTCAGAAC816                            LeuAspValTyrLeuSerHisAsnProAlaGlyThrSerValGlnAsn                               260265270                                                                      GTGCTCCACTGGTCCCAGGCTGTTAAGTCTGGGAAGTTCCAAGCTTTT864                            ValLeuHisTrpSerGlnAlaValLysSerGlyLysPheGlnAlaPhe                               275280285                                                                      GACTGGGGAAGCCCAGTTCAGAACATGATGCACTATCATCAGAGCATG912                            AspTrpGlySerProValGlnAsnMetMetHisTyrHisGlnSerMet                               290295300                                                                      CCTCCCTACTACAACCTGACAGACATGCATGTGCCAATCGCAGTGTGG960                            ProProTyrTyrAsnLeuThrAspMetHisValProIleAlaValTrp                               305310315320                                                                   AACGGTGGCAACGACTTGCTGGCCGACCCTCACGATGTTGACCTTTTG1008                           AsnGlyGlyAsnAspLeuLeuAlaAspProHisAspValAspLeuLeu                               325330335                                                                      CTTTCCAAGCTCCCCAATCTCATTTACCACAGGAAGATTCCTCCTTAC1056                           LeuSerLysLeuProAsnLeuIleTyrHisArgLysIleProProTyr                               340345350                                                                      AATCACTTGGACTTTATCTGGGCCATGGATGCCCCTCAAGCGGTTTAC1104                           AsnHisLeuAspPheIleTrpAlaMetAspAlaProGlnAlaValTyr                               355360365                                                                      AATGAAATTGTTTCCATGATGGGAACAGATAATAAG1140                                       AsnGluIleValSerMetMetGlyThrAspAsnLys                                           370375380                                                                      (2) INFORMATION FOR SEQ ID NO: 5:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 380 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:                                       MetLeuPheGlyLysLeuHisProThrAsnProGluValThrMetAsn                               151015                                                                         IleSerGlnMetIleThrTyrTrpGlyTyrProAlaGluGluTyrGlu                               202530                                                                         ValValThrGluAspGlyTyrIleLeuGlyIleAspArgIleProTyr                               354045                                                                         GlyArgLysAsnSerGluAsnIleGlyArgArgProValAlaPheLeu                               505560                                                                         GlnHisGlyLeuLeuAlaSerAlaThrAsnTrpIleSerAsnLeuPro                               65707580                                                                       AsnAsnSerLeuAlaPheIleLeuAlaAspAlaGlyTyrAspValTrp                               859095                                                                         LeuGlyAsnSerArgGlyAsnThrTrpAlaArgArgAsnLeuTyrTyr                               100105110                                                                      SerProAspSerValGluPheTrpAlaPheSerPheAspGluMetAla                               115120125                                                                      LysTyrAspLeuProAlaThrIleAspPheIleLeuLysLysThrGly                               130135140                                                                      GlnAspLysLeuHisTyrValGlyHisSerGlnGlyThrThrIleGly                               145150155160                                                                   PheIleAlaPheSerThrAsnProLysLeuAlaLysArgIleLysThr                               165170175                                                                      PheTyrAlaLeuAlaProValAlaThrValLysTyrThrGluThrLeu                               180185190                                                                      LeuAsnLysLeuMetLeuValProSerPheLeuPheLysLeuIlePhe                               195200205                                                                      GlyAsnLysIlePheTyrProHisHisPhePheAspGlnPheLeuAla                               210215220                                                                      ThrGluValCysSerArgGluThrValAspLeuLeuCysSerAsnAla                               225230235240                                                                   LeuPheIleIleCysGlyPheAspThrMetAsnLeuAsnMetSerArg                               245250255                                                                      LeuAspValTyrLeuSerHisAsnProAlaGlyThrSerValGlnAsn                               260265270                                                                      ValLeuHisTrpSerGlnAlaValLysSerGlyLysPheGlnAlaPhe                               275280285                                                                      AspTrpGlySerProValGlnAsnMetMetHisTyrHisGlnSerMet                               290295300                                                                      ProProTyrTyrAsnLeuThrAspMetHisValProIleAlaValTrp                               305310315320                                                                   AsnGlyGlyAsnAspLeuLeuAlaAspProHisAspValAspLeuLeu                               325330335                                                                      LeuSerLysLeuProAsnLeuIleTyrHisArgLysIleProProTyr                               340345350                                                                      AsnHisLeuAspPheIleTrpAlaMetAspAlaProGlnAlaValTyr                               355360365                                                                      AsnGluIleValSerMetMetGlyThrAspAsnLys                                           370375380                                                                      (2) INFORMATION FOR SEQ ID NO: 6:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1146 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:                                       TTGTTTGGAAAATTACATCCCACAAACCCTGAAGTGACCATGAATATAAGTCAGATGATC60                 ACCTACTGGGGATACCCAGCTGAGGAATATGAAGTTGTGACCGAAGACGGTTATATCCTT120                GGGATCGACAGAATTCCTTATGGGAGGAAAAATTCAGAGAATATAGGCCGGAGACCTGTT180                GCATTTTTGCAACACGGTTTGCTCGCATCAGCCACAAACTGGATCTCCAACCTGCCCAAC240                AACAGCCTGGCCTTCATCCTGGCCGACGCCGGGTACGACGTGTGGCTGGGGAACAGCAGG300                GGCAACACCTGGGCCAGGAGGAATCTGTACTACTCGCCCGACTCCGTCGAATTCTGGGCT360                TTCAGCTTTGACGAGATGGCTAAATATGACCTTCCCGCCACCATTGACTTCATCTTGAAG420                AAAACGGGACAGGACAAGCTACACTACGTTGGCCATTCCCAGGGCACCACCATTGGTTTC480                ATCGCCTTTTCCACCAATCCCAAGCTGGCGAAACGGATCAAAACCTTCTATGCATTAGCT540                CCCGTTGCCACCGTGAAGTACACCGAAACCCTGTTAAACAAACTCATGCTCGTCCCTTCG600                TTCCTCTTCAAGCTTATATTTGGAAACAAAATATTCTACCCACACCACTTCTTTGATCAA660                TTTCTCGCCACCGAGGTATGCTCCCGCGAGACGGTGGATCTCCTCTGCAGCAACGCCCTG720                TTTATCATTTGTGGATTTGACACTATGAACTTGAACATGAGTCGCTTGGATGTGTATCTG780                TCACATAATCCAGCAGGAACATCGGTTCAGAACGTGCTCCACTGGTCCCAGGCTGTTAAG840                TCTGGGAAGTTCCAAGCTTTTGACTGGGGAAGCCCAGTTCAGAACATGATGCACTATCAT900                CAGAGCATGCCTCCCTACTACAACCTGACAGACATGCATGTGCCAATCGCAGTGTGGAAC960                GGTGGCAACGACTTGCTGGCCGACCCTCACGATGTTGACCTTTTGCTTTCCAAGCTCCCC1020               AATCTCATTTACCACAGGAAGATTCCTCCTTACAATCACTTGGACTTTATCTGGGCCATG1080               GATGCCCCTCAAGCGGTTTACAATGAAATTGTTTCCATGATGGGAACAGATAATAAGTAG1140               TTCTAG1146                                                                     (2) INFORMATION FOR SEQ ID NO: 7:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:                                       GGGCACATGGTTTGTTTGGAAAA23                                                      (2) INFORMATION FOR SEQ ID NO: 8:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:                                       ACTACTATCACGTAGTA17                                                            (2) INFORMATION FOR SEQ ID NO: 9:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 32 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:                                       GGAGATCTAACACCATGTTGTTTGGAAAATTA32                                             (2) INFORMATION FOR SEQ ID NO: 10:                                             (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:                                      GCGTCGACGATAGAAAACCACGT23                                                      (2) INFORMATION FOR SEQ ID NO: 11:                                             (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 4 amino acids                                                      (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:                                      LeuPheGlyLys                                                                   (2) INFORMATION FOR SEQ ID NO:12:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acids                                                      (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                       ThrHisGlyLeuPheGlyLys                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:13:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acids                                                      (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                       ThrHisGlyLeuPheGlyLys                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:14:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7 amino acids                                                      (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                       AlaHisGlyLeuPheGlyLys                                                          15                                                                             (2) INFORMATION FOR SEQ ID NO:15:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 161 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                       GAATTCAGTATTGACAATTTATACATCGATATGGTATAATGTGTGGAATTGTGAGCGGAT60                 AACAATTTCACACAGGAGATCTGCAGGTAAGCTTCAGCTGGGATCCTCTAGAGTCGACGT120                GAAAAATGGCGCACATTGTGCGACATTTTTTTTGTCATATG161                                   (2) INFORMATION FOR SEQ ID NO:16:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 380 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                       LeuPheGlyLysLeuHisProGlySerProGluValThrMetAsnIle                               151015                                                                         SerGlnMetIleThrTyrTrpGlyTyrProAsnGluGluTyrGluVal                               202530                                                                         ValThrGluAspGlyTyrIleLeuGluValAsnArgIleProTyrGly                               354045                                                                         LysLysAsnSerGlyAsnThrGlyGlnArgProValValPheLeuGln                               505560                                                                         HisGlyLeuLeuAlaSerAlaThrAsnTrpIleSerAsnLeuProAsn                               65707580                                                                       AsnSerLeuAlaPheIleLeuAlaAspAlaGlyTyrAspValTrpLeu                               859095                                                                         GlyAsnSerArgGlyAsnThrTrpAlaArgArgAsnLeuTyrTyrSer                               100105110                                                                      ProAspSerValGluPheTrpAlaAlaPheSerPheAspGluMetAla                               115120125                                                                      LysTyrAspLeuProAlaThrIleAspPheIleValLysLysThrGly                               130135140                                                                      GlnLysGlnLeuHisTyrValGlyHisSerGlnGlyThrThrIleGly                               145150155160                                                                   PheIleAlaPheSerThrAsnProSerLeuAlaLysArgIleLysThr                               165170175                                                                      PheTyrAlaLeuAlaProValAlaThrValLysTyrThrLysSerLeu                               180185190                                                                      IleAsnLysLeuArgPheValProGlnSerLeuPheLysPheIlePhe                               195200205                                                                      GlyAspLysIlePheTyrProHisAsnPhePheAspGlnPheLeuAla                               210215220                                                                      ThrGluValCysSerArgGluMetLeuAsnLeuLeuCysSerAsnAla                               225230235240                                                                   LeuPheIleIleCysGlyPheAspSerLysAsnPheAsnThrSerArg                               245250255                                                                      LeuAspValTyrLeuSerHisAsnProAlaGlyThrSerValGlnAsn                               260265270                                                                      MetPheHisTrpThrGlnAlaValLysSerGlyLysPheGlnAlaTyr                               275280285                                                                      AspTrpGlySerProValGlnAsnArgMetHisTyrAspGlnSerGln                               290295300                                                                      ProProTyrTyrAsnValThrAlaMetAsnValProIleAlaValTrp                               305310315320                                                                   AsnGlyGlyLysAspLeuLeuAlaAspProGlnAspValGlyLeuLeu                               325330335                                                                      LeuProLysLeuProAsnLeuIleTyrHisLysGluIleProPheTyr                               340345350                                                                      AsnHisLeuAspPheIleTrpAlaMetAspAlaProGlnGluValTyr                               355360365                                                                      AsnAspIleValSerMetIleSerGluAspLysLys                                           370375380                                                                      (2) INFORMATION FOR SEQ ID NO:17:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 377 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                       LeuPheGlyLysLeuGlyProGlyAsnProGluAlaAsnMetAsnIle                               151015                                                                         SerGlnMetIleThrTyrTrpGlyTyrProCysGlnGluTyrGluVal                               202530                                                                         ValThrGluAspGlyTyrIleLeuGlyValTyrArgIleProHisGly                               354045                                                                         LysAsnAsnSerGluAsnIleGlyLysArgProValValTyrLeuGln                               505560                                                                         HisGlyLeuIleAlaSerAlaThrAsnTrpIleAlaAsnLeuProAsn                               65707580                                                                       AsnSerLeuAlaPheMetLeuAlaAspAlaGlyTyrAspValTrpLeu                               859095                                                                         GlyAsnSerArgGlyAsnThrTrpSerArgLysAsnValTyrTyrSer                               100105110                                                                      ProAspSerValGluPheTrpAlaPheSerPheAspGluMetAlaLys                               115120125                                                                      TyrAspLeuProAlaThrIleAsnPheIleValGlnLysThrGlyGln                               130135140                                                                      GluLysIleHisTyrValGlyHisSerGlnGlyThrThrIleGlyPhe                               145150155160                                                                   IleAlaPheSerThrAsnProThrLeuAlaLysLysIleLysThrPhe                               165170175                                                                      TyrAlaLeuAlaProValAlaThrValLysTyrThrGlnSerProLeu                               180185190                                                                      LysLysIleSerPheIleProThrPheLeuPheLysLeuMetPheGly                               195200205                                                                      LysLysMetPheLeuProHisThrTyrPheAspAspPheLeuGlyThr                               210215220                                                                      GluValCysSerArgGluValLeuAspLeuLeuCysSerAsnThrLeu                               225230235240                                                                   PheIlePheCysGlyPheAspLysLysAsnLeuAsnValSerArgPhe                               245250255                                                                      AspValTyrLeuGlyHisAsnProAlaGlyThrSerValGlnAspPhe                               260265270                                                                      LeuHisTrpAlaGlnLeuValArgSerGlyLysPheGlnAlaPheAsn                               275280285                                                                      TrpGlySerProSerGlnAsnMetLeuHisTyrAsnGlnLysThrPro                               290295300                                                                      ProGluTyrAspValSerAlaMetThrValProValAlaValTrpAsn                               305310315320                                                                   GlyGlyAsnAspIleLeuAlaAspProGlnAspValAlaMetLeuLeu                               325330335                                                                      ProLysLeuSerAsnLeuLeuPheHisLysGluIleLeuAlaTyrAsn                               340345350                                                                      HisLeuAspPheIleTrpAlaMetAspAlaProGlnGluValTyrAsn                               355360365                                                                      GluMetIleSerMetMetAlaGluAsp                                                    370375                                                                         (2) INFORMATION FOR SEQ ID NO:18:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 379 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                       LeuPheGlyLysSerAlaProThrAsnProGluValAsnMetAsnIle                               151015                                                                         SerGlnMetIleSerTyrTrpGlyTyrProSerGluLysTyrGluVal                               202530                                                                         ValThrGluAspGlyTyrIleLeuGluValAsnArgIleProTyrGly                               354045                                                                         LysLysAsnSerGlyAsnArgGlyGlnArgProValValPheLeuGln                               505560                                                                         HisGlyLeuLeuAlaSerAlaSerAsnTrpIleSerAsnLeuProAsn                               65707580                                                                       AsnSerLeuAlaPheIleLeuAlaAspAlaGlyTyrGlyValTrpLeu                               859095                                                                         GlyAsnSerArgGlyAsnThrTrpSerArgArgAsnLeuTyrTyrSer                               100105110                                                                      ProAspSerValGluPheTrpAlaPheSerPheAspGluMetAlaLys                               115120125                                                                      TyrAspLeuProAlaThrIleAspPheIleValLysGluThrGlyGln                               130135140                                                                      GluLysLeuHisTyrValGlyHisSerGlnGlyThrThrIleGlyPhe                               145150155160                                                                   IleAlaPheSerThrAsnProLysLeuAlaGluArgIleLysThrPhe                               165170175                                                                      TyrAlaLeuAlaProValAlaThrValLysTyrThrLysSerLeuVal                               180185190                                                                      AsnLysLeuArgPheIleProProThrMetPheLysIleIlePheGly                               195200205                                                                      AspLysIlePheTyrProHisAsnPhePheAspGlnPheLeuAlaThr                               210215220                                                                      GlnValCysSerArgGluThrLeuAsnValIleCysSerAsnAlaLeu                               225230235240                                                                   PheIleIleCysGlyPheAspSerAlaAsnLeuAsnMetSerArgLeu                               245250255                                                                      AspValTyrValSerHisAsnProAlaGlyThrSerValGlnAsnMet                               260265270                                                                      LeuHisTrpThrGlnAlaValLysSerGlyAsnPheGlnAlaPheAsn                               275280285                                                                      TrpGlySerProAlaGlnAsnValValHisPheAsnGlnProThrPro                               290295300                                                                      ProTyrTyrAsnValThrAlaMetAsnValProIleAlaValTrpSer                               305310315320                                                                   GlyGlyAsnAspTrpLeuAlaAspProGlnAspValAspLeuLeuLeu                               325330335                                                                      ProLysLeuSerAsnLeuIleTyrHisLysGluIleLeuProTyrAsn                               340345350                                                                      HisLeuAspPheIleTrpAlaMetAsnAlaProGlnGluValTyrAsn                               355360365                                                                      GluIleIleSerMetMetAlaLysAspLysLys                                              370375                                                                         (2) INFORMATION FOR SEQ ID NO:19:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                       CACATGGTTTGTTTGGAAAA20                                                         (2) INFORMATION FOR SEQ ID NO:20:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                       CACATGGTCTTTTTGGAAAA20                                                         (2) INFORMATION FOR SEQ ID NO:21:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                       CACATGGCCTATTTGGAAAA20                                                         __________________________________________________________________________ 

We claim:
 1. Nucleic acid characterized in that it is constituted by a first DNA fragment represented in FIG. 8 (SEQ ID NO 1), or a second DNA fragment delimited by the nucleotides situated at positions 1 and 1137 (SEQ ID NO 2) of the DNA represented in FIG. 8, wherein either of the first or second DNA fragments encodes the polypeptide delimited by the amino acids situated at positions 1 and 379 (SEQ ID NO 3) of the amino acid sequence represented in FIG. 9A, this polypeptide corresponding to dog gastric lipase.
 2. Nucleic acid according to claim 1, characterized in that it comprises upstream of the DNA fragment delimited by the nucleotides situated at positions 1 and 1137, a DNA fragment encoding a methionine (SEQ ID NO 4).
 3. Nucleic acid characterized in that it comprises upstream of one of the DNA fragments according to claim 1, a nucleotide sequence encoding a signal peptide.
 4. Nucleic acid characterized in that it comprises either of the two complementary nucleotide sequences constituting the DNA fragments according to one of claim
 1. 5. Nucleic acid characterized in that it hybridizes under conditions of a 2× SSC buffer--0.1% SDS for 2 hours at 50° C. a nucleic acid according to claim
 4. 6. Nucleic acid encoding a polypeptide as defined in any one of claims 1 to 5, and whose nucleotide sequence differs, according to the degeneracy of the genetic code, from the nucleotide sequences defined therein.
 7. Recombinant nucleic acid characterized in that it comprises a nucleic acid according to one of claim 1 inserted into a nucleotide sequence which is heterologous with respect to the abovementioned nucleic acid.
 8. Recombinant nucleic acid according to claim 7, characterized in that it comprises a promoter situated upstream of the nucleic acid under whose control said nucleic acid is transcribed, as well as a sequence encoding signals for termination of transcription which is situated downstream of the said nucleic acid.
 9. Recombinant vector characterized in that it comprises a recombinant nucleic acid according to claim
 7. 10. Recombinant vector characterized in that it comprises elements necessary for promoting and controlling the expression of the nucleic acid of claim 7, in a host cell, and a promoter recognized by the polymerases of the host cell.
 11. Host cell, of the prokaryotic or eukaryotic type, which is transformed by a recombinant vector comprising a recombinant nucleic acid according to claim 7 and comprising the regulatory elements permitting the expression of the nucleic acid.
 12. Process for the preparation of a polypeptide encoded by a nucleic acid, comprising the following steps:culturing a host cell according to claim 11, in an appropriate culture medium, and recovering the polypeptide produced by the said host cell, either directly from the abovementioned culture medium, or after lysis of the host cell.
 13. The process of claim 12, wherein the polypeptide is encoded by a nucleic acid characterized in that it is constituted by a first DNA fragment represented in FIG. 8 (SEQ ID NO 1), or a second DNA fragment delimited by the nucleotides situated at positions 1 and 1137 (SEQ ID NO 2) of the DNA represented in FIG. 8, wherein either of the first or second DNA fragments encodes the polypeptide delimited by the amino acids situated at positions 1 and 379 (SEQ ID NO 3) of the amino acid sequence represented in FIG. 9A, this polypeptide corresponding to dog gastric lipase.
 14. The process of claim 12, the polypeptide corresponding, according to the universal genetic code, to the nucleic acid encoding the amino acids situated at positions 1 and 379 of the amino acid sequence represented in FIG. 9A (SEQ ID NO 3).
 15. The process of claim 12, the polypeptide corresponding, according to the universal genetic code, to the nucleic acid encoding the amino acid sequence delimited by the amino acids situated at positions 1 and 379 of FIG. 9A, and preceded by a methionine at the NH₂ end (SEQ ID NO 5).
 16. Recombinant vector according to claim 10, the promoter of which is an inducible promoter. 